NSF - CAREER : The Listening Machine Annual Report 2004
نویسنده
چکیده
This year, we expanded our investigation of sound analysis to look at a range of different sound 'scenes', including overlapping conversations, ambient everyday sound, and music. In each case, the goal is to abstract useful information similar to that which a human listener would perceive, and in particular to deal successfully with the issues raised by multiple, overlapping sound sources. Our most focused effort was the continued development of the novel model for sound sources we proposed last year, based on treating each spectral instant as a simple deformation of its immediate predecessor (or, in general, its neighbors). This model decomposes smoothly-varying segments of sound into a single spectral profile, and a set of locally-smooth transformation functions, describing how the spectral detail is derived from its predecessors. This year, we extended this model to a two-layer version with separate transformations applied to fine spectral structure (e.g. harmonics, to account for changes in pitch) and broader spectral structure (e.g. the formants of voice, which in general will move independently of the harmonics). The key to this model is the way that the parame
منابع مشابه
NSF - CAREER : The Listening Machine IIS - 0238301 Annual Report 2007 Daniel
We have continued our research into associating words with the soundtracks of recordings of natural environments. We have been working with a database of 1400 “consumer videos” (collected by our collaborators at Kodak) as well as with similar amateur videos downloaded from YouTube. Based on a provisional lexicon of 25 terms that consumers might use as search terms (“music”, “birthday”, “beach”)...
متن کاملNSF - CAREER : The Listening Machine Annual Report 2005
Continuing our broadened theme of machine listening in many contexts, in 2005 we conducted research into automatic extraction of information in complex sound mixtures, in 'personal audio' environmental recordings, from music audio, and for the sounds of marine mammals recorded underwater. 2005 saw the graduation of Manuel Reyes, the Ph.D. student supported by this project from the start. Manuel...
متن کاملNSF-CAREER: The Listening Machine IIS-0238301 2003–2008 Final Report
This six-year project started with the idea of applying sound recognition and separation techniques that had originated in speech recognition to a broader domain of environmental sound mixtures. As it proceeded, the work diversified into several distinct areas, reflecting the different directions of the graduate students primarily supported by the project: Manuel Reyes and Keansub Lee worked on...
متن کاملSpeeding up sum-of-squares for tensor decomposition and planted sparse vectors
We consider two problems that arise in machine learning applications: the problem of recovering aplanted sparse vector in a random linear subspace and theproblemofdecomposing a random low-rank overcomplete 3-tensor. For both problems, the best known guarantees are based on the sum-of-squares method. We develop new algorithms inspired by analyses of the sum-of-squares method. Our algorithms achi...
متن کاملSoftware in Science: a Report of Outcomes of the 2014 National Science Foundation Software Infrastructure for Sustained Innovation (si 2 ) Meeting
The second annual NSF Software Infrastructure for Sustained Innovation (SI) PI meeting took place in Arlington, VA February 24-25, 2014. It was hosted by Beth Plale, Indiana University; Douglas Thain, University of Notre Dame; and Matt Jones, National Center for Ecological Analysis and Synthesis. This report captures the challenges and outcomes emerging from the meeting over the four topic area...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005